Estimating Missing Data Using Neural Network Techniques, Principal Component Analysis and Genetic Algorithms

نویسندگان

  • Abdul K. Mohamed
  • Fulufhelo V. Nelwamondo
  • Tshilidzi Marwala
چکیده

The common problem of missing data in databases is being dealt with, in recent years, through estimation methods. Auto-associative neural networks combined with genetic algorithms have proved to be a successful approach to missing data imputation. Similarly, two new auto-associative models are developed to be used along with the Genetic Algorithm to estimate missing data and these approaches are compared to a regular auto-associative neural network and Genetic algorithm approach. One method combines three neural networks to form a hybrid auto-associative network, while the other merges Principle Component Analysis and neural networks. The hybrid network and Genetic Algorithm approach proves most accurate, when estimating one missing value, while the PCA and neural network version is more consistent and captures patterns in the data most efficiently, in the chosen application.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigation into the use of Autoencoder Neural Networks, Principal Component Analysis and Support Vector Regression in estimating missing HIV data

Data collection often results in records that have missing values or variables. This investigation compares 3 different data imputation models and identifies their merits by using accuracy measures. Autoencoder Neural Networks, Principal component analysis and Support Vector regression are used for prediction and combined with a genetic algorithm to then impute missing variables. The use of PCA...

متن کامل

Comparative Analysis of Neural Network Training Methods in Real-time Radiotherapy

Background: The motions of body and tumor in some regions such as chest during radiotherapy treatments are one of the major concerns protecting normal tissues against high doses. By using real-time radiotherapy technique, it is possible to increase the accuracy of delivered dose to the tumor region by means of tracing markers on the body of patients.Objective: This study evaluates the accuracy ...

متن کامل

Combined Unfolded Principal Component Analysis and Artificial Neural Network for Determination of Ibuprofen in Human Serum by Three-Dimensional Excitation–Emission Matrix Fluorescence Spectroscopy

This study describes a simple and rapid approach of monitoring ibuprofen (IBP). Unfolded principal component analysis-artificial neural network (UPCA-ANN) and excitation-emission spectra resulted from spectrofluorimetry method were combined to develop new model in the determination of IBF in human serum samples. Fluorescence landscapes with excitation wavelengths from 235 to 265 nm and emission...

متن کامل

Combined Unfolded Principal Component Analysis and Artificial Neural Network for Determination of Ibuprofen in Human Serum by Three-Dimensional Excitation–Emission Matrix Fluorescence Spectroscopy

This study describes a simple and rapid approach of monitoring ibuprofen (IBP). Unfolded principal component analysis-artificial neural network (UPCA-ANN) and excitation-emission spectra resulted from spectrofluorimetry method were combined to develop new model in the determination of IBF in human serum samples. Fluorescence landscapes with excitation wavelengths from 235 to 265 nm and emission...

متن کامل

Learning Bayesian Network Structure Using Genetic Algorithm with Consideration of the Node Ordering via Principal Component Analysis

‎The most challenging task in dealing with Bayesian networks is learning their structure‎. ‎Two classical approaches are often used for learning Bayesian network structure;‎ ‎Constraint-Based method and Score-and-Search-Based one‎. ‎But neither the first nor the second one are completely satisfactory‎. ‎Therefore the heuristic search such as Genetic Alg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007